Under consideration for publication in Theory and Practice of Logic Programming 



Normal Forms for Answer Sets Programming 



■ STEFANIA COSTANTINI 

C D , Dip. d'Informatica, Universita di L'Aquila 

C D ■ hoc. Coppito, L'Aquila, 1-67100 Italy 

£\J ' (e-mail: stefcost@di.univaq.it) 

^— > . 

ALESSANDRO PROVETTI 

^ i Dip. di Fisica, Universita di Messina 

• Sal. Sperone 31, Messina 1-98166 Italy 

(e-mail: ale9unime.it) 



{A 
o 



C/5 

O 



submitted 11 July 2001; revised 11 December 2003. 15 March 2004; accepted 30 September 2004 



Abstract 



Normal forms for logic programs under stable/answer set semantics are introduced. We 
' argue that these forms can simplify the study of program properties, mainly consistency. 

The first normal form, called the kernel of the program, is useful for studying existence and 
number of answer sets. A kernel program is composed of the atoms which are undefined 
in the Well-founded semantics, which are those that directly affect the existence of answer 
sets. The body of rules is composed of negative literals only. Thus, the kernel form tends 
to be significantly more compact than other formulations. Also, it is possible to check 
«j , consistency of kernel programs in terms of colorings of the Extended Dependency Graph 

program representation which we previously developed. The second normal form is called 
3-kernel. A 3-kernel program is composed of the atoms which are undefined in the Well- 
founded semantics. Rules in 3-kernel programs have at most two conditions, and each rule 
either belongs to a cycle, or defines a connection between cycles. 3-kernel programs may 
have positive conditions. The 3-kernel normal form is very useful for the static analysis of 
program consistency, i.e., the syntactic characterization of existence of answer sets. This 
result can be obtained thanks to a novel graph-like representation of programs, called 
Cycle Graph which presented in the companion article ( Costantini 2004b I . 

KEYWORDS: answer set programming, program transformation, normal forms. 



1 Introduction 

Answer Set Programming is based on the Answer Set semantics ( Gclf ond and Lifschitz 1988|) 
QGelfond and Lifschitz 1991)) . which gives a declarative meaning to negation as fail- 
ure and establishes a direct connection to Reiter's Default logic and other relevant 
non-monotonic reasoning formalism. In Answer Set Programming (from now on, 
ASP), solutions to a problem are represented by answer sets, and not by vari- 
able substitutions produced in response to a query, like in traditional logic pro- 
gramming. The Answer Set semantics deals correctly with cyclic negative def- 
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initions, by selecting some of the classical models of the theory (in particular, 
those that are minimal and supported). In contrast, in the Well-founded Semantics 
l|Van Gelder et a~1 990l all atoms involved in negative cycles are deemed undefined. 

Our long-term research objectives, to which this article contributes the founda- 
tions, are: 

1. to study the property of consistency (existence of an answer set) of a program; 

2. to develop some practical model theory for ASP, e.g., finding the syntactic 
features that affect existence and number of answer sets; 

3. to investigate the relationship between the restricted syntax of normal forms 
and the development of fast algorithms for ASP computation. 

In this article we propose two normal forms for logic programs. Both normal 
forms have a uniform restricted syntax, i.e., no facts and few positive conditions. 
Hence the representation of programs is rather simple. 

First, we define the kernel normal form, which forbids positive conditions, facts 
(and their direct consequences) and undefined atoms, i.e., atoms that can be con- 
sidered irrelevant from the point of view of program consistency. 

The study of program consistency, as well as actual model computation can be 
made much easier by the 3-kernel normal form, where the length of rule bodies is 
limited to two literals, and some further simplification of irrelevant atoms and rules 
is made (at the expense of reintroducing some positive conditions). We define a 
simple transformation from kernel to 3-kernel programs. 

The results presented here build on our earlier results on program representation 
and analysis under the answer set semantics. First, (Brign ohet al. 19991 ) proposed 
an approach where programs are represented by directed graphs, and deduction 
algorithms are given in terms of graph coloring. The 3-kernel normal form will 
make the corresponding graphs have a regular and simple structure. Such feature 
greatly helps when checking consistency, as shown in HCostantini 2004b). which 
gives the first purely-syntactic and complete characterization of consistent (w.r.t. 
Answer Sets semantics) logic programs. 

We believe that several, alternative normal forms can be introduced and stud- 
ied to improve the body of technical results and useful transformations for ASP 
programs. The contributions of this article are, in summary: 

1. the definition of the kernel format; 

2. a representation theorem for the kernel form; 

3. the definition of 3-kernel normal form; 

4. a normalization algorithm for 3-kernel form. 

2 The kernel Normal Form 

We assume the standard definitions of (propositional) general logic program (hence- 
forth, program) and of Well-founded ( Va n Gelder et al. 1990J) . stable model (|Gelfond and Lifschitz 1 988 ) 
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and answer set semantics IjGelfond and Lifschitz 199l)l . Whenever we mention con- 
sistency (or stability) conditions, or the Gclfond-Lifschitz transformation, we refer 
to those introduced in (Ge lfond and Lifschitz 1988|l . 

Let LT be a logic program, we denote by WFS(U) the well-founded model of II. 
r(II, S) denotes the application of the Gelfond-Lifschitz operator to II w.r.t. the 
set of literals S. 

Definition 1 

A program LT is WFS-irreducible if and only if WFS{U) = (0, 0). 

That is, in WFS-irreducible programs all the atoms have truth value undefined 
under the Well-founded semantics. 

The atoms that are relevant for deciding whether answer sets exist and finding 
them ( Costantini 1995 ) are exactly those that are deemed undefined under the Well- 
founded semantics. Therefore, we aim for normal forms that are WFS-irreducible. 
Below is the definition of kernel normal form. 

Definition 2 

A logic program LT is in kernel normal form (or, equivalently, LT is a kernel program) 
if and only if the following conditions hold. 

1. n is WFS-irreducible; 

2. every rule has its body composed of negative literals only; 

3. every atom occurring in LT appears in the body of some rule; 

It is easy to see that: (a) in kernel programs there are no facts and (b) every atom 
occurs as the head of some rule and also, by definition, in the body of some rule 
(possibly the same one). Moreover, one can observe that all atoms are either part 
of a cyclic definition or are defined by using atoms that are part of a cycle. In 
other words, all atoms somewhat depend on cycles. This notion is made precise 
and developed in the work of l|Costantini 2f)04bjl . 

Example 1 

Consider the problem of 3-colorability of a graph. Given a graph Q, we need to 
show a complete assignment of nodes to one of three colors (in this case green, red 
and blue) such that no adjacent nodes are assigned to the same color. The program 
below solves the 3-colorability problem and is in kernel normal form. 

Let us name nodes with integers. For each node (here we consider node 0) the 
program must contain these 3 rules, which impose the assignment of a color to the 
considered node: 

color(0, red) <— not color(0, blue), not col or (0, green). 
color(0, blue) <— not color (0, red), not col or (0, green). 
color(0, green) <— not color '(0, blue), not color (0, red). 
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The next set of rules is used to record the color assignments that have not been 
chosen: 

n-color(0, red) <— not color(0, red). 
n-color(0, green) <— not col or (0, green). 
n_co/or(0, blue) <— not color(Q, blue). 

Finally, for each edge of Q we need to add to the program the following set of rules 
(the sample set below is for an edge between vertex and vertex 1): 



edge-ok(0, 


1) + 


- not edge-ok(0, 1). 


edge-ok(0, 


1)* 


- not edge-ko(0, 1). 


edgeJto(0, 


1)* 


- not n_color{Q, red), not n_color(\, red). 


edge_ko(0, 




- not n_color(0, green), not n_color(l, green) 


edgeJco(0, 


1)- 


- not n _color (0, blue), not n_color(\, blue). 



The first two rules are used to impose the truth of edge-ok{0, 1) and, equivalently, 
the falsity of edgeJto{0, l) 1 . 

It remains easy to show that each and all answer sets of the program defined as 
in the schema above contain a set of color atoms, one for each node, from which a 
solution to the original problem, i.e., a suitable 3-coloration of the graph, can be 
read out. Similarly, we can show that if the considered graph admits a 3-coloring 
then the correspondent kernel program defined using the schema above must admit 
a corresponding answer set. 

We are now ready to state and prove the main technical result, i.c, that kernel is a 
normal form. 

3 The proof of normality 

The kernel form is normal in the sense that any logic program under answer set 
semantics admits an equivalent kernel program, i.e., one which has the same answer 
sets, modulo some projection. 

This is a consequence of the following representation Theorem^ An alternative, 
elegant proof has been recently suggested by W. Marek and J. Remmel in a personal 
communication. It consists in showing that their rational reconstruction of Turing 
machines ( M arek and Remmel 20 01) can be kernelized, thus showing that kernel 
programs can encode all problems in the NP class. 

In order to state the Theorem, let us recall the following: 
Fact 1 

The answer sets of any logic program form an anti-chain 2 . 

1 Normally, ASP solvers extend the language of logic programs with abbreviations that allow 
to specify concisely a constraint on the value of a formula. So, the two rules above are often 
abbreviated by the formula <— edgejco(0, 1). 

2 A collection of sets is an anti-chain if no component is subset of another. 
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This fact follows directly from minimality of answer sets. Consistency of the given 
program is immaterial here. As ( |Marek and Trus zczynsk i 1999| ) point out, logic 
programs can be seen as a compact representation of anti-chains. The representation 
theorem follows. For simplicity it is worded for normal programs, i.e., with no 
explicit negation (->). 

Theorem 1 

Let H — {ai, . . . a n } be a set of atoms and let V(H) be its powerset. Let A C V(H) 
be an arbitrary anti-chain over Ji. There exists a kernel logic program that has 
exactly the collection of answer sets A, modulo projection over Ti. 

Proof 

The proof is by construction. 

First, suppose that H contains neither m nor _L; these are special atoms used in 
the construction below. Each component, say A 6 A is a set {a\, . . . , a r } C 7i. Let 
{ni, . . . ,n s } = H \ A be the set of atoms not belonging to A. 

Second, we complete TL by adding to it, for each atom hi EH & fresh atom hi. 

Now, the kernel tt_a is built as follows: 

i) for each atom hi G Ti. we put in 7r^ 

hi <— not hi. 
hi <— not hi. 

ii) for each component, A = {ai, . . . , a r }, of A we put in tta the following rule 

m <— not ai, . . . not a7, not m, . . . not n s . 

iii) Finally we put the consistency axiom: 

_L <— not _!_, not m. 

The intuitive reading is: in order for 7T4 to be consistent, m must be true, so at 
least one of the rules defining m must have all its conditions true. These conditions 
describe exactly one of the answer sets. It is easy to see that, by the anti-chain 
property which is enforced by the introduction of the a7 atoms, no two rules for m 
can fire under the same answer set. 

In fact, suppose that X is an answer set for tta s.t. two rules of ita fire, say 
m <— not Hi, . . . not 57, not rt\, . . . not n s . 

m <— not a'i, . .. not a' tl not . . . not n' u . 

where we primed the second rule to distinguish its atoms. Now, by repeated appli- 
cation of Marek and Subrahmanian lemma (Gel fond and Lifschitz 199l| . we con- 
clude that oi, . . . a r 6l and a[, . . . a' t el By the antichain property, {a\, . . . a r } 
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and {a' l7 . . . a' t } are disjoint. Thus, for each ai we have dj ^ {a[, . . . a' t } and then 
by construction al € I; but this means that condition not n x with n x = <Zj is false. 
Therefore, the second rule cannot fire. A similar argument applies for an arbitrary 
a'j {ai, . . .a r }. 

So, each answer set corresponds to the truth of exactly one body of the definition 
of m. In turn, each such body corresponds, up to Ti. (i.e., in all but the special atom 
m), to a component of A, as intended. 

It remains easy to check that for any A, the resulting tt_a is in kernel format. This 
concludes the proof. □ 

Corollary 1 

Any logic program II admits an equivalent — w.r.t. answer set semantics — program 
in kernel format. 

The theorem above establish that kernel program is a normal form, but it does 
not suggest a straightforward, efficient computational mechanism for answer set 
computing. The main reason is that what is taken into account, i.e., anti-chains 
over Ti, may be of cardinality exponential w.r.t. that of Ti. However, it is important 
as far as establishing that kernel equivalents always do exist, and hence that Kernel 
is a normal form. 



4 The 3-kernel normal form 

In this Section we define a variation of the kernel normal form that we call 3-kernel 
normal form, where rules have at most two conditions. The more restrictive 3-kernel 
form has in our view two advantages, i.e., it 

1. allows the study of program consistency in a mathematically elegant way and 

2. might help to discover efficient algorithms and heuristics for answer set com- 
putation. 

Also, we hope that it will become a useful program representation in the search 
for new algorithms/heuristics. We expect the 3-kernel normal form to play for ASP 
the same role that 3SAT plays for propositional logic, i.e., a streamlined formula 
presentation that is generally used for the input of satisfiability solvers and model 
checkers. In our related work fBrignoli et al. 1999| ) IjCostantini 2004b}) . we have 
argued that the existence of answer sets depends on both the cycles which are 
present in the program, and their connections. Indeed, the 3-kernel form is defined 
with the aim of making cyclic definitions and their connections thereof explicit. 
Before defining the 3-kernel normal form, it is useful to report some definitions 
from HCostantini 2004a|) . 



Definition 3 
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A set of rules C is called a cycle if it has the following form: 

Ai <— not A2, Ai 
A 2 <— not A3, A 2 

A„ <— not Ai, A„ 

where Ai, . . . , A„ are distinct atoms. Each A,-, i < n, is a possibly empty conjunction 
5^,... ,Si h of conditions (positive or negative), where ih > and for each literal 
Sij E Ai, ij < ih, 5 ij ^ \ and 8i j ^ not Ai. The Aj's are called AND handles of 
the cycle. 

We say that A, from Definition above is an AND handle for atom Ai, or, equiv- 
alently, an AND handle referring to Ai. Cycles of length n = 1 correspond to 
self-loops Ai <— not Ai, Ai, which are critical to determine consistency. We will say 
that C has size n and it is even (respectively odd) if n = 2k, k > 1 (respectively, 
n = 2k + 1, k > 0). The A's are the atoms involved in cycle C, or, equivalently, the 
composing atoms of the cycle. Rules belonging to a cycle C are said to be involved 
in, or belong to, or form the cycle C. 

Definition 4 

A rule is called an auxiliary rule of cycle C (or, equivalently, to cycle C) if it is of 
this form: 

Ai «- A 

where Ai is involved in cycle C, and A is a non-empty conjunction 5i ± , . . . , Si h of 
literals (either positive or negative), where ih > 1 and for each Si j , ij < ih, Si j ^ Ai 
and 8i j 7^ not Ai. A is called an OR handle of cycle C (more specifically, an OR 
handle for Ai or, equivalently, and OR handle referring to Ai). 

A cycle may have several auxiliary rules. Hence, a cycle may have some AND 
handles, occurring in one or more of the rules that form the cycle itself, and also 
some OR handles, occurring in its auxiliary rules. Handles are seen as connections 
between cycles. In order to make these connections explicit, it is useful that: 

• each handle be composed of just one atom, not belonging to the cycle, and 

• bridges should be as short as possible, even at the expense of re-introducing 
some positive literals. 

These requirements are fulfilled by the definition of 3-kernel normal form. 

Definition 5 

A logic program n is in 3-kernel normal form (or, equivalently, II is a 3-kcrncl 
program) if the following conditions hold. 

1. n is WFS-irreducible; 

2. every atom occurring in II is involved in some cycle; 
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3. each rule of II is cither involved in a cycle, or is an auxiliary rule of some 
cycle, or both; 

4. the body of each rule of II which is involved in a cycle consists of either one 
or two literals; 

5. each atom occurring in the AND handle of a cycle is not involved in that 
cycle; 

6. the body of each rule of II which is an auxiliary rule of some cycle consists of 
exactly one literal. 



In fact, as mentioned above all atoms in a kernel program are either part of a 
cyclic definition or defined (directly or indirectly) using atoms that are part of a 
cycle. However, allowing handles composed of several atoms means that a handle 
constitutes a connection between the cycle it refers to, and several other cycles. 
Also, connections between cycles can be defined indirectly, by means of a chain of 
dependencies. 

In the view of studying program consistency in terms of a Cycle Graph (|Costantini 2004b ) 
where vertexes are cycles and edges are connections between cycles, the 3-kernel 
form guarantees that each handle consists of just one condition, and thus it corre- 
sponds exactly to a connection between cycle C and one other cycle C . This leads 
to a much cleaner representation of the program. 



4-1 3-kernelization 

The aim of this section is to show how to transform a kernel program into a 3-kernel 
program. Given a kernel program, every rule with non-unit body can be eliminated, 
by transforming it into a cycle. Consider for instance program Tr$ : 

p *— not p. 

p «— not a, not c. 

a <— not b. 

b i— not a. 

c *— not d. 

d <— not c. 

Program 775 would be in 3-kernel format but for the second rule. In fact, it is an 
auxiliary rule of cycle {p <— notp}, but it does not match the definition of 3-kernel 
as its body consists of two literals instead of one. This rule is then transformed as 
follows: 
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p <— not p. 

p <— not h\. 

hi <— not h 2 . 

h 2 <— not h 3 , not a. 

h 3 <— not hi. 

h\ <— not /i 5 , not c. 

h5 <— not p. 

a <— not 6. 

6 <— not a. 

c <— not ct. 

<i <— not c. 

The latter program is equivalent to tt^ up to the language of itself. The long 
auxiliary rule has been replaced by a new cycle. More generally, the following trans- 
formation can be applied. 

Definition 6 {Long Rules simplification) 

Let II be a program in kernel normal form. The Long Rule simplification IT of LI 
is created as follows. Each rule 

p : h <— not bi, . . . , not bj 

occurring in LT that is either 

1. auxiliary to a cycle C, with j > 1, or 

2. involved in a cycle C, and j > 2 

is substituted in LT' by the set of rules (that form a new cycle): 

h <— not h\. 

hi <— not h 2 . 

h 2 <— not hs, not bi. 

hs <— not h^. 

hi <— not hs, not b 2 . 

h 2 j <— not h 2 j + i, not bj. 
h2j+i not h. 

where hi, ... , h 2 j+i are fresh atoms, i.e., do not appear in LT. 

By discarding the-truth value of the fresh atoms, one can check that the truth 
conditions for h remain the same as before. 

As far as complexity is concerned, we notice that a long rule has at most n = \A\ 
literals in the body, where \A\ is the number of atoms occurring originally in LT. 
For each condition appearing in the original rule we introduce two new atoms and 
two new rules. Then we introduce a final rule to close the new cycle. Thus, for each 
long rule we add at most 2n atoms and 2n + 1 rules. In the worst case, i.e., when 
we have to apply the transformation to all rules of II, we add n • (2 • n) new atoms 
and n ■ ((2 ■ n) + 1) new rules. 
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Since every atom occurring in II has at least one denning rule, then A is less then 
or equal to to = |IT|, i.e., to the number of rules of II. Hence, the new program after 
Long rule simplification has at most 2to 2 new atoms and 2to 2 + to new rules. 

4-2 Bridge elimination and other useful equivalences 

Even after performing the previous transformation, the program may still not be 
in 3-kcrncl form. In fact, it may contain rules that do not belong to any cycle nor 
are they auxiliary to a cycle. Such rules are said to from bridges, seen as paths 
connecting cycles. Bridges can be eliminated without affecting the semantics, at 
the price of dropping some atoms. The truth values of the dropped atoms can be 
reconstructed at a later stage since it can be proved that the truth value of any 
single atom of a bridge determines the truth values of the each other atom of the 
bridge. Elimination of bridges will now be discussed by case analysis. In cases (i) and 
(ii) we discuss how to eliminate bridges that originate in OR handles (called OR- 
bridges). In cases (iii) and (iv) we discuss how to eliminate bridges that originate 
in AND handles (call AND-bridges). 

(i) Consider a set of rules of the form: 

p <— not p. 
p <— not e. 
e <— not f. 
f <— not a. 
a <— not b. 
b <— not a. 

that corresponds to a bridge between cycles {p <— not p} and {a <— not b. b <— 
not a.} via an OR handle. In fact, p depends on not e (first rule, or first step, of 
the bridge), e depends on not f (second step), / depends on not a (third step). 
Since the bridge originates in an OR handle, it will be called an OR-bridge. The 
bridge involves three rules, i.e. three dependencies. Apart from p, but it involves 
two atoms, namely e and /. Based on the number of atoms, we say that it is an 
OR-bridge of even length. To the extent of checking consistency, this set of rules 
can be substituted by the following set. 

p <— not p. 
p <— not a. 
a <— not b. 
b <— not a. 

Clearly, the latter program is equivalent to the former up to the language. In the 
latter one however there is no bridge, rather a direct connection between the two 
cycles via the OR handle not a. The truth value of atoms e and / can be easily 
reconstructed from each answer set of the latter program. Moreover, the transfor- 
mation seen above can be applied whenever an even number of atoms are used to 
form a bridge. 

(ii) Consider a set of rules of the form: 
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p <— not p. 
p <— not e. 
e <— not f. 
f <- not g. 
g <— not a. 
a <— not 6. 
b <— not a. 

where we can see an OR-bridgc of odd-length between cycles {p <— not p} and 
{a <— not b. b <— not a.} involving atoms e, /, and g. To the extent of checking 
consistency, this set can be substituted by the following: 

p <— not p. 
p <— a. 
a <— not b. 
b <— not a. 

The two set of rules, seen as programs, are equivalent. The price we pay to ridding 
programs of this type of rules is the introduction of a positive condition. The formal 
definition of OR-bridgc follows. 

Definition 7 (OR-bridge) 

An OR-bridge is a set of rules 

p <— not q. 
p <— not X\. 
Ai <— not A2. 

A„ <— not a. 

where the first rule belongs to some cycle C where p and q are defined, the second 
rule is auxiliary to C and a is defined in some other cycle. An OR-bridge is of even 
(resp. odd) length, (even OR-bridge for short) if n is even (resp. odd). 

Definition 8 {even OR-bridge simplification) 

An even OR-bridge can be substituted by the set of rules: 

p <— not q. 
p <— not a. 

Definition 9 (odd OR-bridge simplification) 

An odd OR-bridgc can be substituted by the set of rules: 

p <— not q. 
p <— a. 

(iii) Consider a set of rules of the form: 
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p <— not p, not e 
e <— not f. 
f <— not a. 
a <— not b. 
b <— not a. 

that corresponds to a bridge between cycles {p <— not p} and {a <— not b. b <— 
not a.} via an AND handle. In fact, p depends on not e (first rule, or first step, of 
the bridge), e depends on not / (second step), / depends on not a (third step). Since 
the bridge originates in an AND handle, it will be called an OR-bridge. The bridge 
involves three rules, i.e. three dependencies. Apart from p, it involves two atoms, 
namely e and /. Based on the number of atoms, we say it is an AND-bridge of even 
length. In an AND-bridge, the first dependency that forms the bridge occurs in a 
rule belonging to a cycle, rather than in an auxiliary rule like for an OR-bridge. To 
the extent of checking consistency, this set can be substituted by the following set. 

p <— not p, not a. 
a <— not b. 
b <— not a. 

Clearly, the transformation is equivalence-preserving up to the language, while the 
truth value of omitted atoms (namely the intermediate atoms of the bridge: e and 
/) can be deterministically obtained from the previous ones. In the latter program 
however there is no bridge, rather a direct connection between the two cycles via 
the AND handle not a. 

(iv) Consider a set of rules of the form: 

p <— not p, not e 
e <— not f. 
f <- not g. 
g <— not a. 
a <— not b. 
b <— not a. 

where we can see an AND-bridge of odd length between cycles {p <— not p] 
and {a <— not b.b <— not a.}, involving atoms e,f,g. To the extent of checking 
consistency, this set of rules can be substituted by the following set. 

p <— not p, a. 
a <— not b. 
b <— not a. 

Let us make intuition about AND bridges formal. 

Definition 10 (AND-bridge) 
An AND-bridge is a set of rules 
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p <— not q, not X±. 
Ai <— not \ 2 . 

A n <— not a. 

where the first rule belongs to cycle a C, the second rule is auxiliary to C and 
atom a is defined in another cycle C<i. An AND-bridgc is of even (resp. odd) length 
(even AND- bridge, for short) if n is even (resp. odd). 

Definition 11 (even AND-bridge simplification) 

An even AND-bridge can be substituted by the set of rules: 

p <— not q, not a. 

Definition 12 (odd AND-bridge simplification) 

An odd AND-bridge can be substituted by the set of rules: 

p <— not q, a. 

The OR and AND bridge simplification can clearly be done in reasonable time, 
and have the effect of decreasing the size of the program. After applying the above 
transformations, we have obtained the 3-kernel normal form of the original program 
IT. From the above reasoning, we are lead to the following general conclusions. 

Theorem 2 

For a given program n, its 3-kernel normal form fl = 3fcer(n) is obtained from its 
kernel normal form fcer(II) via the application of (i) long rule simplification and 
(ii) bridge simplification. 

Theorem 3 

For any given program II, the answer sets of the 3-kernel normal form 3fcer(II) 
correspond, up to the language, to those of II. 

Therefore, the answer sets of an arbitrary program II can be obtained by applying 
3-kernelization and then expanding each answer set over the language of II. Notice 
that the bridge simplification reintroduces positive atoms. The 3-kernel form in 
fact admits positive handles. The different kinds of bridges between cycles that a 
program may contain after the simplification introduced above, are illustrated by 
the following example. 

Example 2 

Let ttq be the 3-kernel program: 

a <— not b. 

b <— not a. 

p <— not p, not b. 

q <— not q. 

q <— not a. 
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We can distinguish the even cycle EC\ = {a <— not b. b <— not a} and the odd 
cycles OCi — {p <— not p} and OC2 = {q <— not q}. There is an AND-bridge 
from ECi and OC\, corresponding to the AND handle not b of OC\. There is an 
OR-bridge from EC\ and OC2, corresponding to the OR handle not a of OC2. In 
these handles, that form bridges, is the key of the existence of the answer set {b, q}. 
In fact, the odd cycles if taken alone would be inconsistent. The even cycle instead 
has the two answer sets {a}, {&}. By choosing {&}, the odd cycles are constrained 
by the handles and become consistent. 

5 Discussion 

The study of consistency of ASP programs led us to the definition of kernel and 
3-kernel normal forms. Together with the introduction of the Extended Dependency 
Graph (EDG) program representation in (Bri gnoli et al. 1999| ), these normal forms, 
and the formal results presented here, provide a suitable theoretical framework for 
the study of program properties and the development of an ASP model theory. 

It could be noticed that stating 3-colorability, an NP-complete problem, as a 
kernel program (whose size is polynomial in the size of the considered graph) already 
amounts to a proof of NP completeness for kernel. However, such equivalence is 
indirect and not suggestive of the connection existing between a general programs 
and its kernel counterparts. 

Indeed, the kernel normal form may yield advantages in terms of design and im- 
plementation of ASP computations. First, regularities are exploited in a more con- 
cise program representation. This fact has been exploited, for instance, in the appli- 
cation of neuro-genetic approximation methods to ASP computation in IjBertoni et al. 2 001). 
Second, existing polynomial-time program simplifications are efficiently integrated 
in the answer set computation process and this process is led by semantics consid- 
erations throughout. Finally, the 3-kernel form, is reminiscent of predicate binariza- 
tion, a parallelization technique that has been in studied in the context of parallel 
Prolog program execution, and suggests an approach to the parallel computation 
of answer sets, an issue that only recently has received attention. 

A subject of future research is to investigate whether relevant open problems, such 
as program equivalence, and safe program composition, can be usefully rephrased 
in terms of 3-kernel programs. 
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